Statistical Interpretation of Compound Nouns
نویسندگان
چکیده
We present a method for detecting compound nominalisations in open data, and deriving an interpretation for them. Discovering the semantic relationship between the modifier and head noun in a compound nominalisation is first construed as a twoway disamiguation task between an underlying subject or object semantic relation between a head noun and its modifier, and second as a three-way task between subject, direct object, and prepositional object relations. The detection method achieves about 89% recall on a data set annotated by way of Celex and Nomlex, and about 70% recall on a randomly-sampled data set based on the British National Corpus, with 77% recall on detecting a more general set of compound nouns from this data. The interpretation method achieved about 72% accuracy in the two-way task, and 57% in the three-way task, using a statistical measure based on z-scores — the confidence interval — in selecting one of the relations. Our proposed method has the advantage over previous research in that it can act over open data to detect and interpret compound nominalisations, as opposed to only operating in a limited domain or requiring hand-selection or hand-tuning.
منابع مشابه
An Analysis of Persian Compound Nouns as Constructions
In Construction Morphology (CM), a compound is treated as a construction at the word level with a systematic correlation between its form and meaning, in the sense that any change in the form is accompanied by a change in the meaning. Compound words are coined by compounding templates which are called abstract schemas in CM. These abstract constructional schemas generalize over sets of existing...
متن کاملA Concept-Centered Approach to Noun-Compound Interpretation
A noun-compound is a compressed proposition that requires an audience to recover the implicit relationship between two concepts that are expressed as nouns. Listeners recover this relationship by considering the most typical relations afforded by each concept. These relational possibilities are evident at a linguistic level in the syntagmatic patterns that connect nouns to the verbal actions th...
متن کاملA Workbench for Acquiring Semantic Information and Constructing Dictionary for Compound Noun Analysis
This paper describes a workbench system for constructing a dictionary to interpret compound nouns, which integrates the acquisition of semantic information and interpretation of compound nouns. First, we extract semantic information from a machine readable dictionary and corpora using regular expressions. Then, the semantic relation of compound nouns are interpreted based on semantic relations,...
متن کاملCorpus-Based Approach for Nominal Compound Analysis for Korean Based on Linguistic and Statistical Information
Accurate nominal compound analysis is crucial for in application of natural language processing such as information retrieval and extraction as well as nominal compound interpretation. I,n the nominal compound analysis area, some corpus-based approaches have reported successful results by using statistal cooccurrences of nouns. But a nominal compound often has the similar s t ructure to a simpl...
متن کاملNoun compounds
A noun-compound is a compressed proposition that requires an audience to recover the implicit relationship between two concepts that are expressed as nouns. Listeners recover this relationship by considering the most typical relations afforded by each concept. These relational possibilities are evident at a linguistic level in the syntagmatic patterns that connect nouns to the verbal actions th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005